Performance Analysis of Parallelizing Program Code for Multi-Core Architectures Using OPENMP
نویسندگان
چکیده
Multi-core architectures have become more popular due to better performance, reduced heat dissipation, power consumption and more efficient simultaneous processing of multi tasks. If one wishes to run a single application faster, then that application must be divided into sub programs, or threads, that cooperate to deliver the desired functionality. The Open MP programming paradigm implements loop level parallelism, which is one of the most basic available units of parallelism for parallel Open MP programs [3]. Loop-level parallelism allows an Open MP implementation to easily split the work across multiple threads. This paper mainly focuses on Parallelizing programming code using Open MP and analyzes the program performance using V-tune.
منابع مشابه
Early experiences and results on parallelizing discrete dislocation dynamics simulations on multi-core architectures
Materials science simulations are among the leading applications for scientific supercomputing. Discrete dislocation dynamics (DDD) is a numerical tool used to model the plastic behavior of crystalline materials using the elastic theory of dislocations. DDD simulations require very long running times to produce meaningful scientific results. This work presents early experiences and results on i...
متن کاملExperiences with Achieving Portability across Heterogeneous Architectures
The increasing computational needs of parallel applications inevitably require portability across popular parallel architectures, which are becoming heterogeneous. The lack of a common parallel framework results in divergent code bases, difficulty in porting, higher maintenance cost, and, thus difficulty achieving optimal performance on target architectures. Our paper examines two representativ...
متن کاملCoarse-Grain Task Parallel Processing Using the OpenMP Backend of the OSCAR Multigrain Parallelizing Compiler
This paper describes automatic coarse grain parallel processing on a shared memory multiprocessor system using a newly developed OpenMP backend of OSCAR multigrain parallelizing compiler for from single chip multiprocessor to a high performance multiprocessor and a heterogeneous supercomputer cluster. OSCAR multigrain parallelizing compiler exploits coarse grain task parallelism and near ne gra...
متن کاملTowards Autotuning of OpenMP Applications on Multicore Architectures
In this paper we describe an autotuning tool for optimization of OpenMP applications on highly multicore and multithreaded architectures. Our work was motivated by in-depth performance analysis of scientific applications and synthetic benchmarks on IBM Power 775 architecture. The tool provides an automatic code instrumentation of OpenMP parallel regions. Based on measurement of chosen hardware ...
متن کاملPerformance Analysis and Optimization of the OP2 Framework on Many-Core Architectures
This paper presents a benchmarking, performance analysis and optimization study of the OP2 ‘active’ library, which provides an abstraction framework for the parallel execution of unstructured mesh applications. OP2 aims to decouple the scientific specification of the application from its parallel implementation, and thereby achieve code longevity and near-optimal performance through re-targetin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012